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ABSTRACT 


An (associative) algebra is a vector space over a field equipped with an associa- 
tive, bilinear multiplication. By use of a new bilinear operation, any associative algebra 
morphs into a nonassociative abstract Lie algebra, where the new product in terms of the 
given associative product, is the commutator. The crux of this paper is to investigate the 
commutator as it pertains to the general linear group and its subalgebras. This forces 
us to examine properties of ring theory under the lens of linear algebra, as we determine 
subalgebras, ideals, and solvability as decomposed into an extension of abelian ideals, and 
nilpotency, as decomposed into the lower central series and eventual zero subspace. The 
map sending the Lie algebra L to a derivation of L is called the adjoint representation, 
where a given Lie algebra is nilpotent if and only if the adjoint is nilpotent. Our goal is 
to prove Engel’s Theorem, which states that if all elements of L are ad-nilpotent, then L 


is nilpotent. 
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Chapter 1 


Introduction 


Abstract Lie algebras are algebraic structures used in the study of Lie groups. 
They are vector space endomorphisms of linear transformations that have a new operation 
that is neither commutative nor associative, but referred to as the bracket operation, or 
commutator. Recall that an associative algebra preserves bilinear multiplication between 
elements in a vector space V over a field F’, where V x V — V is defined by (x, y) > xy for 
x,y € V. The bracket operation yields a new bilinear map, over the same vector space, 
that turns any associative algebra into a Lie Algebra, where V x V > V is now defined 
by [z,y] > (ay — yx). Central to the study of Lie algebras are the classical algebras, 
which will be explored throughout this paper to derive isomorphisms, identify simple 
algebras, and determine solvability. After examining some natural Lie algebra examples, 
we will delve into the derived algebra, which is analogous to the commutator subgroup of 
a group, and make use of it to define a sequence of ideals called the derived series, which 
is solvable. We will then use a different sequence of ideals called the descending central 
series to classify nilpotent algebras. All of this will arm us with the necessary tools to 
prove Engel’s Theorem, which states that if all elements of D are ad-nilpotent, then L is 
nilpotent. We must first recall some basic group axioms, ring axioms, and vector space 


properties as a precursor to the study of Lie algebra. 


The study of Lie Algebra requires a thorough understanding of linear algebra, 
group, and ring theory. The following provides a cursory review of these subjects as 
they will appear within the scope of this paper. Unless otherwise stated, our refresher 
for group theory was derived from Aigli Papantonopoulou’s text Algebra Pure and 


Applied [Aig02]. 


Definition 1.1. An associative algebra is an algebra A whose associative rule is associa- 


tive: x(yz) = (xy)z for allz,y,zEA 


Definition 1.2. A nonempty set G equipped with an operation * on it is said to form 
a group under that operation if the operation obeys the following laws, called the group 


axioms: 
e Closure: For any a,b € G, we haveaxbeG. 
e Associativity: For any a,b,c € G, we have ax (b*c) = (a*b) *c. 
e Identity: There exists an element e € G such that for alla € G we have axe = 


exa=a. Such an element e € G is called an identity in G. 


e Inverse: For each a € G there exists an element a~! € G such that axa! = 


a~'xa=e. Such an element a7! € G is called an inverse of a in G. 


Proposition 1.3. (Basic group properties) For any group G 


e The identity element of G is unique. 
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e For each a € G, the inverse a_~ is unique. 


e For anyaéG, (a!)"'=a 
e For any a,b € G, (ab)-| =b-'a7} 


e For any a,b € G, the equations ax = b and ya = b have unique solutions or, in 


other words, the left and right cancellation laws hold. 


Example 1.0.1. The general linear group, denoted gl(n), consists of the set of invert- 
ible n x n matrices. By the above, given that multiplication of invertible n x n matrices 
is associative, and each invertible matrix has an inverse and an identity, gl(n) forms a 


group under multiplication. 


Definition 1.4. nonempty subset H of a group G is a subgroup of G if H is a group 
under the same operation as G. We use the notation H C G to mean that H is a subset 
of G, and H <G to mean that H is a subgroup of G. For a group G with identity element 
e, {e} is a subgroup of G called the trivial subgroup. For any group G, G itself is a 
subgroup of G, called the improper subgroup. Any other subgroup of G besides the two 


above is called a nontrivial proper subgroup of G. 


Definition 1.5. The number of elements in a group G is called the order of G and is 
denoted |G|. G is a finite group if |G| is finite. If a € G, the order |a| of a in G is 
the least positive integer n such that a” = e. If there exists no such integer n, then |a| is 


infinite. 


Definition 1.6. A map »:G—> G' from a group G to a group G’ is called a homo- 
morphism if p(ab) = y(a)y(b) for all a,b € G (in —p(ab) the product is being taken in 
G, while in p(a)y(b) the product is being taken in G’). 


Proposition 1.7. (Basic group homomorphism properties) Let py : G — G' be a 


homomorphism. Then 
e v(e) =e’, where e is the identity of G and e’ the identity of G’. 
© y(a') = (~(a)) + for anyacG 


e y(a”) = y(a)” for anyne Z 


If |a| is finite, then |p(a)| divides |al. 
e If H is a subgroup of G, then y(H) = {y(x)|a € H} is a subgroup of G’ 
e If K is a subgroup of G’, then p-'(K) = {x € Gly(x) € K} is a subgroup of G. 


Example 1.0.2. For any group G, the identity map is always a homomorphism, since 


if p:G—-G’ is the identity p(x) = x, then p(xy) = ry = v(x) y(y) 


Definition 1.8. For the homomorhpism y, the kernel of y is the set {x € Gly(x) = e'}, 
denoted Ker vp. 


Definition 1.9. Let G be a group, H a subgroup of G, anda eéG. Then the set aH = 
{aH|h © H} is called a left coset of H in G, and the set Ha = {halh € H} is called a 
right coset of H inG. 


Definition 1.10. The group consisting of the cosets of H in G under the operation 
(aH)(bH) = (ab)H is called the quotient group of G by H, written G/H. 


Definition 1.11. If G is any group, then the center of G, denoted Z(G), consists of the 


elements of G that commute with every element of G. In other words, 
Z(G) ={%#EGlay=yx VyeG} Cie h) 


Note that ey = y = ye for ally € G, soe € Z(G), and the center is a nonempty subset of 
G. Letaeé G. Then the centralizer of a in G, denoted Cg(a), is the set of all elements 


of G that commute with a. In other words 
Ca(a) = {y € Glay = ya} (1.2) 


Note that for anya € G we have Z(G) C Cg(a). In other words, the center is contained 


in the centralizer of any element [Aig02/. 


Definition 1.12. Let G be a group and H a subgroup of G. If for allg € G we have 
gH = Hg, then we say H is anormal subgroup of G and write HAG. (Recall the normal 
subgroup test: A subgroup H of G is normal in G if and only if xHa~! C H for all x in 
G). 


The following material for group and ring theory was derived from Joseph A. 


Gallian’s Contemporary Abstract Algebra [Gal04]. 


Definition 1.13. An isomorphism from a group G onto itself is called an automor- 


phism of G. 


Definition 1.14. Let G be a group, and leta€ G. The function pq defined by va(x) = 


axa! for all x € G is called the inner automorphism of G induced by a. 


Definition 1.15. A set R equipped with two operations, written as addition and multi- 
plication, is said to be a ring if the following four ring axioms are satisfied, for any 


elements, a,b, and c in R: 
e Ris an Abelian group under addition. 


e Closure: abe R 


e Associativity: a(bc) = (ab)c. 
e Distributivity: a(b+c) = ab+ac and (b+ c)a = ba+ca. 


Definition 1.16. A subring A of a ring R is called a (two-sided) ideal of R if for every 


r€ Rand everya€ A both ra and ar are in A. 


Theorem 1.17. Ideals are Kernels Every ideal of a ring R is the kernel of a ring 
homomorphism of R. In particular, an ideal A is the kernel of the mappingr >r+A 
from R to R/A. 


Definition 1.18. A unity (or identity) in a ring is a nonzero element that is an identity 
under multiplication. A nonzero element of a commutative ring with unity that has a 


multiplicative inverse is called a unit of the ring. 


Theorem 1.19. First Isomorphism Theorem for Rings Let y be a ring homomor- 
phism from R—- S. Then the mapping from R/kery to p(r), given by r+ kerp > y(r) 


is an isomorphism. In symbols, R/kery = y(r) 


Theorem 1.20. Second Isomorphism Theorem for Rings If A is a subring of R 
and B an ideal of R, then ANB is an ideal of A and A/ANB is isomorphic to (A+ B)/B. 
(Recall that {A+ B=a+bla€ A,be€ B}) 


Theorem 1.21. Third Isomorphism Theorem for Rings Let A and B be ideals of 
a ring R with BC A. A/B is an ideal of R/B and (R/B)(A/B) is isomorphic to R/A. 


Theorem 1.22. Correspondence Theorem Let I be an ideal of a ring R. There exists 
a bijection between the set of all ideals J of R such that IC J and the set of all ideals of 
fot: 


{J|I is an ideal of R,ICJ}—>{K|K_ is an ideal of r/I} 
J—> J/I 
Definition 1.23. In a ring R the characteristic of R, denoted char R, is the least 


positive integer n such thatn-a = 0 for alla € R. If no such n exists, we say char 


R=0. 


Definition 1.24. A field is a commutative ring with unity in which every nonzero ele- 


ment 1s a unit. 


Unless otherwise stated, the material for our vector space review was derived 


from Stephen H. Friedberg et al’s, Linear Alebra [FIS03}. 


Definition 1.25. Let F be a field. A set V equipped with two operations, written as 


addition and multiplication, is said to be a vector space over F if 
e Vis an Abelian group under addition. For alla,b € F and allu,veéeV, 
e The product av € V is defined 
e a(fu+w) =av+aw 
e a(bv) = (ab)uv 
elv=v 


An element of v € V is called a vector. The identity element of V under addition is 
called the zero vector and written 0. An element of a € F is called a scalar, and the 


operation of forming av is called scalar multiplication [Aig02/. 


Definition 1.26. A subset U of a vector space V over a field F is called a subspace of 
V if U is a vector space over F' with the operations of addition and scalar multiplication 


defined on V 


Example 1.0.3. [fT : V > W is a linear map, the kernel of T and the image (range) 
of T defined by 

ker(T) ={x@ €W : T(x) = 0} 

im(T) ={weEewWw:T(a),c¢ew} 


are subspaces of V. 


Definition 1.27. Let V be a vector space over a field F. A vector space homomorphism 


that maps V to itself is called an endomorphism of V 


Definition 1.28. Let V and W be vector spaces. We say that V is tsomorphic to W if 
there exists a linear transformation T: V — W that is invertible. Such a transformation 


is called an tsomorphism from V to W. 


Definition 1.29. Let V be a vector space over C. An inner product on V is a function 
that assigns, to every ordered pair of vectors x and y in V, a scalar in C, denoted (x,y), 


such that for all x,y,z in V and all c in C, the following hold: 
1. (x +2,y) = (x,y) + {z,y) 


2. (ery = Ch y) 


3. (x,y) = (y, z) 
4 ae) SO af ee 


Definition 1.30. Let V be a vector space over a field F. A function f from the set V x V 
of ordered pairs of vectors to F is called a bilinear form on V if f is linear in each 


variable when the other variable is held fixed; that is, f is a bilinear form on V if 
1. flax, + %2,y) =af(r1,y) + f(z2,y) 


2. f(z,ay1 + y2) = af (x, y1) + f(x, y2) 


Any nondegenerate bilinear form on V consists of all operators 7 on V un- 
der which the form f is infinitesimally invariant, i.e., that satisfies f(a(v),w) + 
f(v, z(w)) = 0. [FIS03] 
The kernel of a symmetric bilinear form is given as: kerf = {v: f(v,w)=0 forall weV}. 


That is, for all v € W, there exists a w € V such that f(v,w) = 0. 


Definition 1.31. An alternating space is defined as the set of all vector spaces V and 


alternate bilinear forms such that f : V x V > F where: 
1. f(z,y +z) = f(x,y) + f(a, 2) 
2. f(xt+y,2) = f(,2) + fy, 2) 
3. f(ax,y) = af(x,y) = f(x, ay) 
4. (a,x) =0 which implies that f(x, y)=-f(y,2) 


Definition 1.32. In matrix terms, an n-linear function y : Mnxn(F) > F is called 
alternating if, for each A € Mnxn(F), we have y(A) = 0 whenever two adjacent rows 


of A are equal. [FIS03] 


Theorem 1.33. Let A be anm xn matriz, B and C ben x p matrices and D and E be 


q xm matrices. Then 
1, A(B+C)=AB+AC and (D+ E)A=DA+EA 
2. a(AB) = (aA)B = (AaB) 
2 tpAS A= Aly, 
4. If V is an n-dimensional vector space with an ordered basis B, then [Iv|p = In 


Definition 1.34. Let A,B and C be matrices where C = qj, A = ajz, and B = by;. 


Then by matrix multiplication 


n 
C=AB where qj = So andr 
k=1 


The formula for the determinant of A, denoted det(A) or |A|, where A isanxn 


matrix can be expressed as cofactor expansion along the 1st row of A 


det(A) = yyy - det(Aj;) 


j=l 
Where the scalar cj = (—1)'*) det(Aj;) is called the cofactor of the entry of A if row i, 


column 3. 


Example 1.0.4. The Laplace expansion of A where A is a2 x 2 matrix is 
det(A) = Ay (—1)'*1det Ay, + Ajo(—1)!*? det Ajy 


Definition 1.35. (Adjoint of a Matrix) Let A be ann x n matrix. The matrix B = [bj;| 
with bij = cj: (for ci as defined in Definition 1.33), for 1 <i,j <n is called the Adjoint 
of A, denoted Adj(A). 


Definition 1.36. The trace of a matrix is the sum of its diagonal entries. 


Definition 1.37. Given ann x n matrix A = {aj;}, the transpose of A is the matrix 


A? = {biz}, where bij = Ajj. 


Definition 1.38. If the transpose of a matrix is equal to the negative of itself, the matrix 
is said to be skew symmetric. This means that for a matrix to be skew symmetric, 


At =—A. 


Definition 1.39. Let F' be a field and V,W vector spaces over the field F. A function 
T:V > W is said to be a Linear transformation from V to W if for allc,d € F, and 
allxz,y € V we have 


T(cx + yd) = cT(x) + dT(y) 
The following are simple properties of linear transformations: 
e T(0) =0 
e T(cx+y) =cT(x)+T(y) 
¢ T(x—y) =T(a«) - Ty) 


e For 71, 22,...,2n € V and aj, a9,...,an € F’, we have 
n nr 
o> Cn = Ss" Gis (a3) 
i=l i=l 


Example 1.0.5. Consider the map T : Mnxn(F) 4 Mnxm(F) defined by T(A) = At. 


We can show that T is a linear transformation. Let A= Aj; and B= Bi; 


(A+ B)’ = (Aij + Bij)’ = (Aiz)’ + (Biy)’ = AP + BY 
T(cAij) = (cAiz)’ = c(Aiy)” = cA" 


Definition 1.40. A basis, 6, for a vector space V is a linearly independent subset of V 
that generates V. If there exists a finite set that forms a basis for V over a field F, then 
the number n of vectors in such a basis {v1...Un} is called the dimension of V over F, 
written dim Fy. If there exists no finite basis for V over F, then V is said to be infinite 


dimensional over F. 


Definition 1.41. Let T=T,4:V —> V be a linear transformation, where V is a finite- 
dimensional vector space over a field F, and let0 #vu € V be such that Ta(v) = Av for 
some \€ F. Then v is called an eigenvector of T (and of A), and X the corresponding 
eigenvalue of T (and of A). 


Definition 1.42. A is invertible if there exists ann x n matriz B such that AB = 


BA=I. 


10 


Definition 1.43. a matrit M € Mnxn(F) is called orthogonal if MM‘ = I. 


Theorem 1.44. Jf A is a square matrix of order n > 1, then A(adjA) = (adjA)A = 
|A|I,. The (i,7)th element of A(adjA) is: 


JA] ei=3 
0SiF¢j 


S| aipday = >> aij Ajax = 

k=1 k=1 
Therefore A(adjA) is a scalar matrix with diagonal elements all equal to | Al. 
Where |A| represents the determinant of A.[Eve66]/ 


Proposition 1.45. The following equation denotes the inverse of A: 


_ AdjA 


Avi 
[A] 


Definition 1.46. /f A is a square matrix of order n, then the X matrix [A — AI(n)] 
is called the characteristic matrix of A. The determinant |A — XI(p)| 1s called the 


characteristic determinant of A, and the expansion of it is a polynomial of degree n: 
FA) = (- 1)" = pr! + por”? +. — 1)" Dr] 
[Eve66] 


Theorem 1.47. In the characteristic function f(A) of matriz A, py = |A| and p; = trA. 
[Eve66] 
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Chapter 2 


Definitions and First Examples 


2.1 The Notion of a Lie Algebra and Linear Lie Algebras 


Let R be a fixed commutative ring (or a field). An associative R-algebra is 
a group A that is abelian under addition, and has the structure of both a ring and 
an R-module such that the scalar multiplication satisfies r- (xy) = (r-a)y = a(r-y) 
for allr € Rand w,y € A. Additionally, A contains a unique element 1 such that 
l-¢ =x2=2-1 forallxz €¢ A. A is therefore an R-module together with (1) an R- 
bilinear map A x A — A, called the product, and (2) the multiplicative identity, such 
that multiplication is associative: x(yz) = (xy)z, for all x,y, and z in A. If one negates 
the requirement for associativity, then one obtains a non-associative algebra. If A itself is 
commutative, it is called a commutative R-algebra. The commutator of two elements x 
and y of a ring or an associative algebra is defined by xy—yx. (The anticommutator of two 
elements x and y of a ring or an associative algebra is defined by {x,y} = xy + yx). Any 
algebra A over F’, where F is a vector space with associative multiplication can be made 
into a Lie algebra LD via the commutator, yielding a structure similar to a ring modulo 
L. The commutator is also referred to as the the bracket operation. In order to prove 
Engel’s Theorem, we shall restrict ourselves to the following definition of abstract Lie 
algebra, in contrast to the concrete Lie algebra definition featured in universal enveloping 
algebras and the Poincare Birkhoff Witt Theorem. Additionally, we will assume all base 
fields have characteristic 4 2. Unless otherwise stated, the material for this chapter is 


derived from [Hum72]. 
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Definition 2.1. A vector space L over a field F, is a Lie algebra if there exists a bilinear 


multiplication L x L + L, with an operation, denoted (x,y) +> [xy], such that: 


1. It is skew symmetric where |x, x] = 0 for all x in L (this is equivalent to 


[x,y] = —[y, xz] since character F' 4 2). 
2. It satisfies the Jacobi identity |x[yz]] + [y|za]] + [z[ry]] =0 (2, y, 2 € L). 


Example 2.1.1. Given ann dimensional vector space End (V), the set of all all linear 
maps V ++ V with associative multiplication (x,y) > xy for all x,y, where xy denotes 
functional composition, observe, End (V) is an associative algebra over F'. Let us define 
a new operation on End (V) by (a, y) 4 xy — yx. If we denote xy — yx by [x,y], then 
End (V) together with the map |-,-] satisfies Definition 2.1, and is thus a Lie algebra. 


Proof. The first two bracket axioms are satisfied immediately. The only thing left to 
prove is the Jacobi identity. Given x,y,z € End (V), we have by use of the bracket 


operation: 


[z[yz]] + [y[z2]] + [z[zy]] =0 


= x(yz — zy) — (yz — zy)a + y(zu — xz) — (za — wz)y + 2z(xy — yx) — (ay — yx)z 


= XYZ — LZY — Y2X A ZYX + YSU — YXZ — eXY + UZY + ZXLY — ZyUL — LyzZ + yLZ 


= (xyz — xyz) + (awzy — ezy) + (yzu — yzx) + (zyx — zyx) + (yxz — yxz) =0 


Definition 2.2. The Lie algebra End (V) with bracket [x,y] = xy — yx, is denoted as 
gl(V), the general linear algebra. 


Example 2.1.2. We can show that real vector space R° is a Lie algebra. Recall the 
following cross product properties when a,b and c represent arbitrary vectors and a, 2 


and y represent arbitrary scalars: 
1.axb=—-(bx a), 


2. ax (8b+ yc) = B(a x b) + ¥(a x c) and 
(aa + 6b) x c=a(axc)+ B(bx 0), 
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([MT03]) 

Note, ax a=-—(axa), by property (1), letting b =, therefore, axa=0. By the above 
properties, the cross product is both skew symmetric (property 1) and bilinear (property 
2). By vector triple product expansion: x x (y x z) = y(a-z)—2z(x-y). To show that the 


cross product satisfies the Jacobi identity, we have: 


[z[y, 2]] + [ylz, 2] + [elz, y]] = 2 x (yx z) + yx (2x az) +2~x (xx y) 
= [y(x-z) — 2(x-y)| + [2(y- x) — aly: 2) + [2(z-y) — y(z-2)] 


=0 {Since the dot product is commutative} 


Therefore, by definition 2.1, the real vector space R® is a Lie algebra 


Definition 2.3. A subspace K of L is called a (Lie) subalgebra if |xy| © K whenever 
ryEek. 


2.2 Classical Lie Algebras 


Classical algebras are finite-dimensional Lie algebras. Each classical algebra A), 
Bi, Ci, and D; has an associated algebra, represented by symmetric, skew symmetric, and 
orthogonal matrices. Let s be an n x n matrix. We shall prove that each classical algebra 
representation, equipped with basis and dimension satisfying rs+ sx‘ = 0 (a? =transpose 
of x), is a subalgebra of the linear Lie algebra gl(V). Note: a proper subalgebra maintains 


dimension less than that of gl(n). 


(A,) The set of all endomorphisms of V having trace zero is denoted by sl(V), the 
special linear algebra. Letting x,y € sl(V), we can show that sl(V) is a subalgebra of 


gl(V). 


Proof. 


tr{x,y] = tr(ay) — tr(yx) =0 {Since the trace of a matrix preserves bilinearity} 


ér([a, [y, 2l] + ly, [2,21] + [z, fe, yl) = trl, 0] + trly, 0] + tr[z, 0] = 0 


Therefore, both Lie algebra axioms are satisfied, hence by Definition 2.3, sI(V) is a 
subalgebra of gl(V). 
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The dimension of sl(V) is found by determining the number of linearly independent 
matrices of sl(V) that yield trace 0. By counting the matrices e;;(i 4 7), and adding 
them to the matrices h; = e;; — e:414+41, we find dim s(V) =1+ (1+1)? —(1+1) 


(C,) The set of endomorphisms of V having dim V = 21, that satisfies 
f(v,w) = —f(w,v), is denoted as sp(V), the symplectic algebra. Let s bea 


: ‘ l ; 
nondegenerate, skew symmetric matrix where s = . Recall that a matrix 
ny 7 


A€ Mnxn(F) is called skew-symmetric if A‘ = —A. We can show sp(2l, F) is a 


m 


subalgebra of gl(V) if we partition x as 7 = ‘ where m,n,p,q € gl(v). 


P 4q 


Proof. Given x,y € sp(V), 


f([zy](v), w) + fF, [zy] (w)) 

=f((zy — yx)v,w) + f(v,(vy—yx)w) {By definition of the bracket operation} 
=f(xy(v),w) — f(yx(v),w) + f(v,zy(w)) — f(v,ya(w)) {By definition of bilinearity} 
=f(xyr)), wv) — Fy@(r)), w) + Fv, ew) — Fe, (Y(@)) 

=— fy), ew) + F(a), y(w)) — Fae), yw) + F(y(v), e(w)) {By skew symmetry} 
=0 

Therefore [x,y] € sp(V), hence sp(V) is a subalgebra of gl(V) when sp(v) is of even 


dimension. We show that sp(V) requires even dimensionality by considering the 


determinant of x when n is odd. From sx = —2‘s, 


det(sx) = det(—a's) <= + det(s)det(x) = det(—zx')det(s) 
<= det(x) = det(—z') 
= (-1)"det(z") 
= (—1)7*-ldet(z’) 
= —det(z') 


but then det(x) = —det(x‘), therefore, det(x) = 0 


(Bj) The set of all endomorphisms of V with dimension 2/? + | satisfying 


f(x(v), w) = —f(v, x(w)), such that f is a non-degenerate bilinear form on V with 
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1 0 O 
matrix s=]|0 0 J, |, corresponds to the orthogonal algebra, denoted as o(V), or 
0 Lt, O 
0(21+ 1, F). Recall a matrix A € Myxn(F) is called orthogonal if AA‘ = I. We can 
a by bo 
partition x as x= ]|c, m_n |. The orthogonal algebra is a subalgebra of gl(V), 
c2 p 4 


where f(x(v),w) = —f(v,xz(w)) (the same conditions as for (C}). 


Example 2.2.1. When V = F", take for f(x(v),y) the form S~ XiY = x'y where 
X = (£1, %2,...,%n) and Y = (y1, Y2,--, Yn). The corresponding Lie algebra is o(n, F’) 


0 L 
(D,) Let D; be an an orthogonal algebra with s = '). Let m = 21, then D; 
I; 0 
consists of all the matrices with dimension 2? — 1, satisfying f(a(v),w) = —f(v, x(w)). 


Definition 2.4. The following is a list of several other subalgebras consisting of the 


upper, lower, and triangular matrices of g\(n, F’): 
e Let n(n, F) be the strictly upper triangular matrices (a;;) = 0 ifi > j 
e Let 0(n, F) be the set of all diagonal matrices (a;;), aj; = 0 if i Aj. 
e Let t(n, F) be the set of upper triangular matrices (ajj), aij = 0 ifi> J. 


Example 2.2.2. Given algebras t(n, F),0(n, F), n(n, F'), we can compute the dimension 


of each algebra, by exhibiting a basis. 


0 aji2 oe Qin 

0 0 Bete a2n 
e For n(n, F) = 

O Oe Sees 1G 


Using the formula for the area of a triangle: A = bh the dimension for n(n, F’) is 


n(n-1 
therefore n(n) 
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at1 0 0 

0 age 0 
e For o(n, F) = 

0 0 Cis 


The dimension of o(n, F’) is clearly n. 


Qyi+t a2 -. Gln 
0 a22 “8 ag 
e For t(n, F)= : 
0 0 2k “Ann 
Again using the formula for the area of a triangle, the dimension of t(n, F’) is 
n(n+1) 


2 


Definition 2.5. The (external) direct sum of two Lie algebras L,L', written L @ L' is 
the vector space direct sum, with |-,-] defined “componentwise”: [(1, y1), (X2, y2)| where 


{(x,0)}, {(y,0)} are ideals and thus ”nullify” each other. That is, [L, L'| =0. [Sam90] 


Example 2.2.3. We can prove that t(n, F’) is the direct sum of o(n, F’) and n(n, F). By 
the above definition, we must show [t(n, F’), t(n, F)| = 0 where 
t(n, F) = o(n, F) + n(n, F). 


Proof. Let A € n(n, F’) and B € o(n, F’), and C € t(n, F): 


0 a12 Fy Ain by4 0 Prey 0 bi4 a42 bers Ain 

0 0 Bri a2n 0 boo aes 0 0 boo a6 Qa2n 
+ a 

0 O ::- O 0 O -:: ban 0 O  ... ban 


where A+ B is an upper diagonal matrix. Therefore t(n, F) = o(n, F) + n(n, F’). To 
show AN B = {0}, we have 


(AB)i; = So Ain Bry + Ss" Ain Brj 


k=1 k=i+1 
t n 
=320- Bis + So An-0 
k=1 k=i+1 


Where Aj, = 0 in the first sum since i > k for A strictly upper triangular. For the 
second sum k >i > j—1, thus By; = 0. Therefore AN B = {0}, meaning 
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AB — BA=[AB| =0. Therefore [o(n, F), n(n, F)] C n(n, F). We can construct a basis 
ex; for n(n, F) such that e;; is 1 for the (ij) element, and zero elsewhere. By definition 


of the commutator: 
lei, €ij] = exseig — eigen = Cay 


By the above, when i > j, n(n, F) C [o(n, F’), n(n, F’)], and thus 
[o(n, F’), n(n, F’)] = n(n, F). Since o(n, F) + n(n, F) = t(n, F): 


[o(n, F) + n(n, F),o(n, F) + n(n, F)] = [o(n, F), o(n, F)] + [n(n, F), n(n, F)] 


[n(n, F), o(n, F)] + [o(n, F), n(n, F)] 
=0+04 [o(n, F), n(n, F)] C n(n, F) 


Conversely: 


n(n, F) = [o(n, F), n(n, F)] C [t(n, F), t(n, F)] 


Therefore [t(n, F), t(n, F)] = n(n, F’) and thus t(n, F) = o(n, F) + n(n, F) 


2.3. Lie Algebras of Derivations 


The derivative between two functions f and g is a linear operator that satisfies the 


Leibniz rule: 
1. (fg) =fiot+ fo 
2. (af)’ =af’ where a is a scalar. 


Given an algebra A over a field F’, the derivation of A is the linear map 6 such that 
O(f,g) = fo(g) + 6(f)g for all f,g € A. The set of all derivations of A is denoted by 
Der(A). Given 6 € Der(A), f,g € A, and a € F. By property 2., 


(ad) (fg) = 0d(fg) = a(f5(g) + 6(f)g) = af d(g) + ad(f)g 


where the Leibniz rule is satisfied if and only if af = fa where F is a field. 


Example 2.3.1. Let x,y € End(V), and 6,6’ € Der(V). By definition of the 
commutator, [6, 6’] = (66’ — 6'6): 
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e Der(V) is a vector subspace of End(V): 


d[z,y] = d(ay — yx) = (xy) — 6(ya) 
= x6(y) + d(x)y — (yd(x) + d(y) x) 
= 6(y)x — yd(x) + xd(y) — d(y)x 
= (d(x), y] + [x, 6(y)] 


e The commutator [6, 6'] of two derivations 6,6’ € Der(V) is again a derivation. 


([5,5'](x))y + @([, 5'}(y)) = ((55’ — 0'6) (x) )y + a(50" — 58) (y) 
= (66'(x) — 6'6(x))y + 2(55'(y) — 5'6(y)) 
= 66'(x)y — 5'5(x)y + 266’ (y) — 20'5(y) 
= 6(6'(a)y + «d'(y)) — 5'(5(x)y + #6(y)) 
= 6(5'(x)y) + 5'(x)5(y) + 8(a)d"(y) + 2d(8'(y)) 
— 5'(5(x)y) — 6(x)d'(y) — 5'(x)d(y) — 25'5(y) 


( 
( 


5(5'(x)y + 25'(y)) — 0(6(x)y + 2d(y)) 
= 6(0'(xy)) — 0'(5(xy)) 
= [6,0] (ay) 


Definition 2.6. Given x € L, the map y > [x,y], is an endomorphism of L, denoted 
ad,,, where ad, is an inner derivation. Derivations of the form [x[yz]] = |[xy]z] + [y[xz]] 


are inner. All others are outer. 


Example 2.3.2. By example 2.3.1, the collection of derivations, Der(V), satisfies skew 
symmetry. By definition 2.6, Der(V) satisfies the Jacobi identity. Therefore Der(V) 
defines a Lie algebra. 


Definition 2.7. The map L + DerL sending x to ad, is called the adjoint 


representation of L. 


This is akin to taking the ad homomorphism of g — gl(g). To show ad is a 


homomorphism: 


ad (x,y}) = [ad(z), @d(y)| = ad(z)ad(y) — adi ad x) 
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if and only if: 
[[, y], 2] = (x, y, 21] — ly, [e, zl] 
0 = [z, [y, 2]] + ly, [z, 2]] + lz, [z, yl] 


That is, if and only if the Jacobi identity is satisfied, where [[z, y], z] = —[z, [x, y]] and 
—ly, [z, 2]] = ly, [z, 2] by skew symmetry. 


Definition 2.8. A representation is faithful if its kernal is zero, i.e., if 


o(x) =0@2=0, where the adjoint operator defines the trivial representation. [Sam90] 


2.4 Abstract Lie Algebras 


Let L be a finite dimensional vector space over Ff’. Any vector space can be made into a 
Lie algebra simply by setting [x, y] = 0 for all vectors x and y in L. The resulting Lie 
algebra is called abelian. Recall that a group G under multiplication is said to be 
Abelian if G obeys the commutative law or, that is, if for every a,b € G we have 

ab = ba {Aig02]. If an associative algebra is abelian and therefore commutative, then 


ab = ba = [a,b] = 0, where [a, b] is the associated commutator. 


Example 2.4.1. Given L as a Lie algebra, with [xy] =0 for all x,y € L, 
[zy] = (zy — yx) =0& zy = ya 
Therefore L under trivial Lie multiplication, is abelian. Similarly, |yx] = 0. 


Definition 2.9. If L is any Lie algebra with basis 11,...,2%, then the multiplication 
table for L is determined by the structural constants, the set of scalars {b;;,} such 


that ee aes | = paar bijkXk- 


Example 2.4.2. By use of the bracket axioms, the following set of structural constants 


define an abstract Lie algebra: 
1. dix = 0 = dijn + d5ix 
2. do (bigkOkim + O51kbkIm + bLikdkjim) = 9 


Proof. 
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1. Recall [xa] = 0 for all x € L. 


n 
ae} Gee 
k=1 
= by1r1 + byoxre +... + diintn = 0. 
Then, since 2; is a basis, bj, = 0 for all k. 
2. Given [x;, 2j] = Ty 4 Oijn@R, notice bij~ + b5;, = 0 if and only if 


S (bij + bjik eR = 0: 
i 


=> Ss" Dijk@k + Ss" Djik®k =0s [eee ng [epee =0 
k k 


But: 
[egee| = UX 5 — Li Xy 


[eexyl = Li Xy — LX 5 


& [ages] + [eyxel = LX; — LjX, + UXjX;,— Tx; = 0 


Definition 2.10. The assorted systems of structural constants of a given Lie algebra 
relative to its basis form an orbit (the set of all transformations of one element) under 


this action [Sam90]. 


Example 2.4.3. The orbit of the system cj, =0 for alli,j,k forms a natural 


isomorphism to the abelian algebra [X,Y] =0 of dimension n. 


Example 2.4.4. Consider a Lie algebra L with basis 11, ...t,. The structural constants 
are enumerated as followed: e, = (1,0,0), e2 = (0,1,0), and e3 = (0,0,1). By use of the 


Lie bracket axioms, We have: 


[e1, e2] = e3 
[e1, €3| = =e) 


[e2, €3| =€] 
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Therefore, by definition, the structural constants are: 


bi21 = 0, bi22 = 0, bizs = 1, 
bi31 = 0, bi32 = —1, b133 = 0, 
bo31 = 1, bo32 = 0, ba33 = 0 
Example 2.4.5. We can use structural constants to show that a three dimensional 


vector space with basis (x,y,z) such that [xy] = z, [xz] = y, [yz] = 0 defines a Lie 


algebra. 


Proof. the structural constants are: 


bi21 = bi22 = 0, b123 = 1, 


bi31 = 6133 = 0, bige = 1, 


b931 = b232 = b233 = 0 


Where bijx = bijk — bj: = 0 for all structural constants above (by skew-symmetry). By 


the Jacobi identity, we have: 


[z[yz]] + [eleyl] + [ylze]] = [2,0] + [2,2] + ly. -— y] = 0 +0+0=0 


Therefore the above basis defines a Lie algebra structure over F’. 


Definition 2.11. For any field F’, in the vector space 
F” = {(b1, bo, b3...,bn)|b, EF for 1<i<n} 


a basis is formed by the vectors 


ej = (1,05: ,0) 
€2 = (0,1, ,0) 
C4 =(0,0j-445:1) 


Where {e1, €2,...,€n} is called the standard basis for F” over F. 


Theorem 2.12. Let V and W be finite-dimensional vector spaces (over the same field). 


Then V is isomorphic to W if and only if they are of equal dimension. 
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Corollary 2.13. Let V be a vector space over F'. Then V is isomorphic to F” if and 
only if dim(V) = n [FIS03] 


Example 2.4.6. By Corollary 2.18, there exists an isomorphism between gl(n) and F”. 
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Chapter 3 


Ideals and Homomorphisms 


3.1 Ideals 


Lie algebra ideals are intrinsic to much that follow. In group theory, ideals are used to 
define the kernel of a group, normal subgroups, and the structure of nilpotent and 
solvable groups. In a ring, every kernal is an ideal of a homomorphism, and every 
subgroup is normal. By definition of the normal subgroup, if H is a subgroup of G, for 
all g € G we have gH = Hg © |g, H] = 0. Unless otherwise stated, the material for this 
chapter is derived from |[Hum72]. 


Definition 3.1. A subspace I of a Lie algebra L is called an ideal of L ifxel,yeL 
together imply [x,y] € I. The construction of ideals in Lie algebra is analogous to the 


construction of normal subgroups in group theory. 


By skew-symmetry, all Lie algebra ideals are automatically two sided. That is, if 
[x,y] € I, then [y,xz] € I. The kernel of a Lie algebra L and L itself are trivial ideals 


contained in every Lie algebra. 

Example 3.1.1. The set of all inner derivations ad,, x € L, is an ideal of Der(L). Let 

6 € Der(L). By definition of inner derivations, for all y € L: 

[5, adz](y) = (6(adz) — (adz)d)(y) = d[a, y] — ade(5(y)) 

[5(@), y] + [x, 6(y)] — [x, 6(y)] {Since dla, y] = [6(x), y] + [x 5(y)It 
= ad(5(x)y) 


Therefore, by Definition 3.1, ad, is an ideal of Der(L). 
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Definition 3.2. The center of a Lie algebra L is the set 
Z(L) = {z € L|[z, z] =0 for all c € L} 


The center respresents the collection of elements in L for which the adjoint action ad,,) 
yeilds the zero derivation. It can therefore be thought of as the kernel of the adjoint 
map from L to Der(L). Clearly, the center is an ideal, therefore Z(L) satisfies skew 
symmetry. Given y € L, [z, [z, y]] = —[z, |y, z]] — [y, [z, z]], both [y, z] and [z, x] belong 
to Z(L) by definition, therefore, Z(L) satisfies the Jacobi identity. If Z(L) = L, then L 


is said to be albelian. 

Definition 3.3. The centralizer of a subset X of L is defined to be 
C1(X) = {x € L|[x, X] = 0} 

By the Jacobi, Cz, (L) is a subalgebra of LZ where C,(L) = Zr. 


Definition 3.4. Consider the Lie algebra L, having two ideals I and J, the following 
are ideals of L: 

I+J={ae+y|z elye J} 

UT, J] = {So eiyiles El,yi€ J} 


Proposition 3.5. Let L be a Lie algebra and let I, J be subsets of L. The subspace 
spanned by elements of the form x+y (a €I,y € J), is denoted I+ J, and the subspace 
spanned by elements of the form [x,y], (x € I,y € J), is denoted [I,J]. If 1,11, Io, J, K 


are subspaces of L, then 


tL. hh + Ip, J] CG [esc + [Io, J] 


3. [I,[J, K]] S [J (6, T]) + (6, (2, J] 
[Wan75] 


If J is an ideal of L, then the quotient space L/I (whose elements are the linear cosets 
L +I) induces the well defined bracket operation, where [x + I, y + I] = [x,y] + I for 
x,y € L [Sam90]. 
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Definition 3.6. By use of the bracket operation |-,-], the quotient space L/I becomes a 
quotient Lie algebra [Sam90]. 


Definition 3.7. If the Lie algebra L has no ideals except itself and 0, and if moreover 
[LL] £0, we call L simple. 


Example 3.1.2. Let L be a Lie algebra, and L = sl(n,F') where char F 4 2. It can be 


shown that L is simple if we take as a standard basis for L the three matrices 


We will begin by computing the matrices of ad,, ad;,, and ady. 
e ad,: 


ad,(x) = [x,24] =0=0-2+0-h+ 


O-y 
0 1 1 0O 1 O 0 
ad,(h) = [x,h] = ch —-ha = — 
0 O QO -l 0 -l 0 O 
QO -l 1 QO -2 
= = = =-24 =-2-r4+0-y+0-h 
0 0 0 0 O 
0 1 0 0 0 0 01 
adz(y) = [z,y] = xy — yx = = 
0 0 1 O 0 0 
0 0 1 O 
_ _ a =h=0-24+0-yth 
0 0 0 0 -l 
0 -2 0 


Therefore ady = 10 0 1 


e ady: 
pehnaies * 1G )() 
1 0 0 o/ \1 0 
_ fo 0 ca. 
a 0e 
ady(y) = [y,y] =O =0-x4+0-A+ 
men t oadtg 
(’ y [’ (’ ) 
= _ _ sy 
1 0 —1 0 2 0 
0 O 0 
Therefore ady=|}—-1 0 0 
0 O 2 
e adh: 


wocnewnol IE I-6IC 
-f)-63)-G) ate 


adp(h) = [h,h] =0=0-24+0-h+4+0-y 


snnanne( VOILE 
a 9) \- eS )) =a oe peaptiee 


2 0 O 
Therefore ad, =|0 0 O 


0 0 -2 


Let I #0 be an ideal of L where the eignevectors of x,y,h correspond to the distinct 
eigenvalues 2,—2 and 0 respectively (since char F #2). Let ax + by+che€TI be 
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arbitrary. If ad, is applied twice: 


ad, (ax + by + ch) = alaz, ax] + blz, y| + c[z, h] = 0+ bh — 2caxr 
ad,(0 + bh — 2ca) = b[x, h] — 2cla, x] = —2bx 


then —2bx € I. If ad y is applied twice: 


ady(ax + by + ch) = aly, x] + bly, y] + cly,h] = —ah + 0+ 2cy 
ady(—ah + 0+ 2cy) = —aly, h] + 2cly, y] = —2ay 
then —2ay € I. So ifa or b is nonzero, than either y or x is contained in I, thus I = L. 


Ifa and b are both zero, then ch € I where ch £0. Thus, again I= L. Therefore L is 
simple. [Hum72] 


Example 3.1.3. We can show that sl(3, F’) is simple, unless char F = 3, using the 
standard basis hy,h2, eij(t Aj). If 1A 0 ts an ideal, then I is the direct sum of 
eigenspaces for ad hy or ad hg. We will determine the eigenvalues of by afizing h, and 
hz to e;;, 1.e, take the adjoint using diagonal basis matrices. By definition of basis 


matrices, for sl(3, F), hy = e11 — €22 and hg = e22 — €33. 
ad, (€12) = [A1, e12] = hiei2 — e12h1 
= (e11 — €22)e12 — €12(€11 — €22) = e12 — (—e12) = 2e12 


Continuing this way, we construct the table below to denote the eigenvalues ad hy, ad hg 


acting on e;; 


[h1, e43] [ha, ex3] 
[h1, €12] = 2e12 (ho, (€12)] = —leie 
[hi, €21] = —2e21 (ha, (e21)] = lear 
[Ai, €13] = leis (ha, (e13)] = leis 
[A1, (es1)] = —legi [ho(e31)] = —le3; 
[Ai, (€23)] = —levs || [Aa(e2s)] = 2¢€23 
[hi, (e32)] = less [h2(e32)] = —2ez2 


Figure 3.1: Table of eigenvalues 


Therefore the eigenvalues are 2,1,—1,—-2 and 0, and since char F 4 3, the eigenvalues 


are distinct. If I is an ideal, then by definition, 


adhy(I) = [adhy, I] CI 
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To show that sl(3, F’) is simple, recall the dimension of sl(l +1, F) is 

1+(1+1)? —(1+1) =2+(2+1)? —-(2+1)=8. When dimI =1, I= Fz, but I is not 
an ideal. When dimI = 2, I is spanned by x and y, but I is not an ideal. Continuing 
this way, by the linearity of the adjoint operator, it is evident that I #0 is an ideal if 
and only if dimI = 8. Therefore s\(3, F') is simple. 


Definition 3.8. The normalizer of a subalgebra K of L, is denoted by N,(K), where 
Nz(K) = {x € L|[x,k] C K} ts a subalgebra of L. If K = Ni (Kk), we call K 


self-normalizing. 


In the normalizer, K is the largest ideal of L that absorbs L. If K is an ideal of L, then 
Ni(K) = L. 


Example 3.1.4. We can show that t(n, F) and o(n, F’) are self-normalizing subalgebras 
of gl(n, F), whereas n(n, F) has a normalizer t(n, F). Note: for all upper triangular 


matrices, the diagonal entries of the matrix are precisely it’s eigenvalues. 


e By definition of the normalizer, we must show 
Nz(t(n, F)) = {6 € gl(n, f)|[b, t(n, f)] € t(n, f)} where Nzi(t(n, F)) = t(n, f) is a 
subalgebra of gl(n, f). By the definition of structural constants, b = \oy_, bijkXk- 


n n 
[b, exk] = y bijkEkekk — Ekk y bijk@k 
k=1 k=1 


n n 
= y bijeijekk — y bijekkeij 


ij=l ij=l 
n n 

= So bigdjneu — S > digSuieey *- eajemt = Sjnen 
ij=l ij=l 


n n 
= S° bikeik — oS bejej € t(n, F) 
i=1 j=l 


Where >>; bixeix implies that by, =0, fori < k and ae byj€ky implies that 
bjx =0 for 7 <k. Therefore by = 0 for k <1 as required, and thus b € t(n, f). 


e To show that o(n, F’) is self normalizing, we must show: 


N_z(o(n, F’)) = {b € gl(n, F’)|[b, o(n, F)] € o(n, F)} where Nz (o(n, F)) = o(n, f). 


n n 
[b, €xk] = y bijkEkekk — Ekk S bijk@k 
k=1 k=1 


n n 
y  bizeigern — D_, dig ean€iy 


ij=l ij=l 
n n 

= So bizdjneu — S > digSuieng *- agent = Sjnen 
ij=l ij=l 


n n 
= s bikeik — ye brejekj € 0(n, F) 
i=1 j=l 


Where >>; bixeix implies that by, =0 fori #7 and Soyel byj€kj implies that 
by; =0 for 7 Ak. Therefore by =0 fork Al as required and thus b € o(n, f) 


which makes o(n, f) self normalizing. 


e To show that n(n, Ff’) has a normalizer in t(n, F’), let x € gl(n, F), x ¢ o(n, F), 
then: 


[x,n(n,F)] Z n(n, F) where 
n(n, F)t(n, F) Cn(n,F) and t(n,F)n(n, F) C n(n, F) 


Therefore [n(n, F),t(n, F)] C n(n, F). Let b Z t(n, F), so bj; AO wheni > j. 
[eji, b] = bs bijkejiLk — Se bijgkTRC ji 
k=1 k=1 


n n 
= Do bizejiexs — D7 biggest 


ij=l ij=l 
n n 
= So bigdisess — D7 big Sige 
ij=l ijal 
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= ae bij€55 — doi bien ZO Since its (j,j) entry is bj;. Therefore b is not in 


the normalizer of t(n, F). 


3.2. Homomorphisms and Representations 


Homomorphisms as they appear in ring theory articulate nicely in Lie algebra. Vector 


space homomorphisms are linear maps where the group action is abelian under addition 
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and preserves scalar multiplication. Given a Lie algebra L, the next few definitions 
allow us to visualize Lie algebra homomorphisms, and generalize the well known 


isomorphism theorems to vector spaces modulo I. 


Definition 3.9. A linear transformation ¢: L — L' is called a homomorphism if 
o((x, y]) = [b(x), o(y)], for all x,y € L. ¢ is called a monomorphism if its kernal is 
zero, an epimorhpism if its image equals L', and an isomorphism if ¢@ is both a 
monomorphism and epimorphism, that is, if d is bijective. 

Example 3.2.1. Let ~ be a homomorphism yp: L > L’, such that L,L' are Lie algebras 
over F’. We can show that Ker vy is an ideal of L. Letx € L ands € L. Now 


vls, 2] = (p(s), e(@)] = [e(s), 0] = 0 


The adjoint representation of a Lie algebra L is a homomorphism using the map ad: 
L = gl(V). By definition, ker ad=Z(L). If L is a simple Lie algebra, then Z(L) = 0, 
thus by Definition 3.9, ad; — gl(L) is a monomorphism, and therefore an invertible 
homomorphism, which proves that any simple Lie algebra is isomorphic to a linear Lie 


algebra. 


Definition 3.10. The special linear group SL(n, F) denotes the kernel of the 


homomorphism 
det: GL(n, F) — F* = {x € F\x £0} 
where F is a field. 


We will now forge a proof of Lie algebra isomorphism theorems analogous to the ring 


theory isomorphisms described in the introduction. 
Proposition 3.11. Let L and L’ be Lie algebras 


1. Ife: L> L’ is a homomorphism of Lie algebras, then L/Kerp = Img. If L has 
an ideal I, included in Kery, then the map y: L/I > L’ defines a unique 


homomorphism that makes the following diagram commute (7 = canonical map): 


f ieee Bi 


AG) 


L/I 
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2. Let I and J be ideals of L such that I Cc J, then J/I is an ideal of L/I and 
(L/I)/(J/1) is naturally isomorphic to L/J. 


3. Let I, J be ideals of L, then there exists a natural isomorphism between (I + J)/J 
and I/(IN J). 


Proof. 

1. Let y: L + L’ be a homomorphism of Lie algebras. Our first goal is to show 
given J is any ideal of L included in Kery, there exists a unique homomorphism 
w:L/I > L’. Let Ik, Il € L/I. To show w is a group isomorphism, note that 
j=l. If 7 € Il then y(j’l) = vy’) e() = (1), since y is a homomorphism. 
Therefore w is well defined. For all 7,k € I: 


PURI) = b(Lkl) = p(kl) = elke) = vdDYoUk) 


Therefore ~ is a homomorphism. Let kery = J, 


p(Uil) =1 y(l) =1 léekerp => lel s> Il=!I 


2. Fist note: if i €¢ IM J, then by definition of the intersection, [i, xz] € J and 
[t,x] € J, so [t,2] € 1N J. Therefore, IM J satisfies the definition of an ideal. 
Given x +I € L/I and j+I1 € J/I, we have: 


fe+J,j9+]) = [2,7] + [2,2] 4+ U9) 4+ 0,7] = [2,9] +0 € J/T 


Where [x, 7] € L. Therefore, J/I is an ideal of L/I. Additionally, given 
e+I,y+IeL/I,c+I=y+I mod J/I implies that (x — y)+J € J/I and 
therefore « — y= 7 € J. Thus x and y are equivalent mod J in L. 


3. Let 77+ 91,72 +72 C24 J. If iy +91 = 12 + jo mod J, then 
(i, —t2) =jo-—ji =j€ J, but i; —i2 € J. Therefore 7; —ig EC INS 


Example 3.2.2. Since n(n, F’) is in the kernal of t(n, F), by Proposition 3.11, we can 
show t(n, F)/n(n, F) = o(n, F). Given x,z € t(n, F),y € n(n, F), we have 


[(et+y), (2+y)] = [2,2] + [ey] + ly, 2] + yy] 
=[z,z])+y {Since y € n(n, F)} 


Where |x, z] € t(n, F). Any upper triangular matrix can be diagonalized, therefore 


[x,z] € o(n, F) and thus |x, z] + y € o(n, F). 
(ay) + (ety) =(e+z)+y 


Since the sum of any two upper triangular matrices is also upper triangular, 


(c+ 2) € t(n, F), and therefore (x +z) + y € o(n, F). 


Example 3.2.3. For small values of l, isomorphisms occur among certain classical 
algebras. We can show that A;, B,C, are all isomorphic. Additionally, we will show 
that By is isomorphic to C2, and D3 is isomorphic to A3. We can form our desired 
isomorphisms given the following root system decomposition for classical algebras 


An — Dy: 


Aj: 
ey; (t#3J) 
hp ey ewig. (haa <1) 
Ba 
Ci —ei4iiti (2<¢<1+1) 
enuin1 —4igai1 (1<i<Jd) 
€1ig1—eiitii (l<i<J) 
engi eng wag (est 47 <0) 
ealiea— Se lo 
eta epaias, (ej te) 
Cl: 


C4 — Cit (l<i<d) 
ej —engii (<i Aj <I) 


ee (ead) 


eiltg tee (LSi<j <I) 
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eg eaang- (ls4920 


Gig — egy Uses <9 <1) 


eugene (hst<g = 2 


[Hum72], [Wan75] 


e To prove A; ~ By, we must show sl(2, F) = 0(3, F). The diagonal basis matrix for 
Ay 18 €11 — €922- 
[e11 — €22, €12] = (€11 — €22)€12 — e12(€11 — €22) = €12 + €12 = 2e12 
[e11 — €22, e21] = (€11 — €22)e21 — €12(€11 — €21) — 2e21 


The following map enumerates the root system from A, to By, where sl(2, F’) is a 


2x 2 matrix and 0(3, F) is a3 x 3 matriz: 


€11 — €22 — 2(e11 — €22) 
€12 —> 2(e€13 — €21) 


€21 2(e12 = €31) 


Similarly, the following map signifies the root system from B, to C , where 
sp(2,F) is a2 x 2 matriz: 
2(€11 — €22) — €11 — €22 
2(e13 — €21) > e12 
2(€12 — €31) — €21 
Notice that the basis matrices for A, and C, are identical, therefore by the 
transitive property, the isomorphism between A, and C, is immediate. and 
Ay = Bi = Cj 
e For By = C2, we must show 0(5, F) = sp(4, F’) by adjoining the diagonal matrices 
from Bz,C>2 to corresponding eigenvectors to determine the eigenvalues involved in 
their respective basis matrices. The diagonal matrices for By are as followed: 
hy = €22 — €44 


ha = €33 — €55 


Denote 1 = [hi, x] and Ap = {ho, x], where x is an eigenvector, then eigenvalue 


A = (Aq, A2). Recall that [h;, x] = —[hj, 4] 


hy, €21 — €14 


ho, e21 — €14 
hy, €12 — €41 


ho, e21 — e414 


= hy (e21 — e14) — (e21 — e14)h1 


= (€22 — €44)(€21 — €14) — 


= (e33 — €55)(€21 — €14) — 


=e41-e12 =—1 


=0 


(€21 — €14)(€22 — €44) = €21 — C14 = 1 


(e21 — €14)(€33 — 655) = 0 
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Therefore, the eigenvalue for eigenvector e21 — e14 is (1,0) = a, and the eigenvalue 


for eigenvector e12 — e41 is (—1,0) = —a. The remaining eigenvalues are derived 
similarly: 
Aq = (hi, 2] AQ = (ho, x] A= (Az, A2) 
hy, €14 — €21 =i | hg, e14 — €21 =0 a = (1,0) 
[a e199 = ea | = = he, e12 — ea1] = —a = (—1,0) 
[hi, €32 — 45] = —1 |] [ha, e32 — €45] = 1 8 =(-1,1) 
hy, €23 — €54] =1 || [h2, €23 — e54] = —1 —8 = (1,-1) 
hy, €15 — €31] = ha, e15 — e31] = a+ 8 = (1,0) + (-1,1) = (0,1) 
hi, e51 — e13] = [ho, e51 — e13] = —1 —(a+ 8) = (-1,0) + (, -1) = (0,-1) 
hy, €25 — e34] = 1 hg, e25 — €34] = 1 2a + 8 = 2(1,0) + (-1,1) = (1,1) 
[hi, €52 — €43] = —1 |] [he, e52 — e43] = —1 (2a + 8B) = —[2(1,0) + (-1,1)] = (-1,-1) 


Figure 3.2: Table of eigenvalues for Bo 


The diagonal matrices for Cg are as followed, where | = 2: 


/ 
hy = e11 — €33, 


/ 
hy = €22 — €44 


Denote X‘, as the eigenvalue of {hi,,x] and X45, as the eigenvalue of [h5,x] where x is 
1 rn 2 2 


an eigenvector. Then eigenvalue ’ = (X4, A). 
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ri X5 N = (Ai, Ad) 
[hi, €21 = e34] =-l [hb, €21 = e34| = 1 al = (-1, 1) 
hi, e12 — e43| = 1 [hy, e12 — €43| =-l -—q' = (ae —1) 
[hi e13] = 2 [h5, e13] = 0 B= x0) 
ees e31| = —2 [h5, 31] = 0 — = (—2, 0) 
hi, e14 + €23] = 1 [h5, e14 + €23] = 1 a’ + 6’ = (-1,1) + (2,0) = (4,1) 
[fi €41 + €32] = —1 || [hp, e41 + e32] = —1 || —(e' + 6") = —[(-1, 1) + (2, 0)] = (-1, -1) 
hy, e51 — €13] =O || [hg,e51 — e13] = —1 || —(e' + 6’) = (-1,0) + 0-1) = (0,-1) 
[h4, €24| = 0 [A4, €24| = 2 2a’ + B' = 2(-1, 1) + (2, 0) = (0, 2) 
[h{, e42] = 0 [hy, e42] = —2 (2a + 6") = —[2(-1, 1) + (2,0)] = (0, -2) 


Figure 3.3: Table of eigenvalues for C2 
We can construct a linear transformation, where 
1 1 1 1 
= —5h + alte = 54 + aie 
Let the isomorphism between By and Cy be defined by the following map: 
ahleaH Late? Bia —pH, LAr <2 


Therefore, By = Co, where: 


€22 — e444 = —4(€11 — €33) + $(€22 — e144) 


€33 — 55+ 5 (€11 — €33) + 5(€22 — 44) 


2 
€21 — €14 +> ¥2 (ex — €34) 
2 
€12 — €41 +? V2 (ery 1643) 
€32 — €45 +> €13 
€23 — €54 +> €31 
2 
€15 — 63114 V2 (e14 + €23) 
2 
€31 — €41 +> V2 (ea + €32) 
€25 — €34 +> €24 
€43 — €52 +> €42 


e For A3 = D3, we must show sl(4, F) = 0(6, F). The diagonal matrices for A3 are: 


hy = e11 — €22, he =e€22—€33, hg = €33 — €44 


Denote 1 = [h1, 2], A2 = [h2, 2], and A3 = [h3, 2]. To find the eigenvalue that 
corresponds to eigenvector e€13, we have: 

(hi, €13] = (e11 — €22)€13 — €13(€11 — €22) = €13 

[ho, €13] = (€22 — €33)€13 — €13(€22 — €33) = —e13 

[h3, €13] = (e33 — €4a)€13 — €13(€33 — e44) = —€13 


Therefore eigenvalue X = (Aq, A2, A3) corresponding to e13 can be written as 


a = (1,-1,-1). 
The diagonal matrices for Ds are: 


/ / / 
hy =e11 —€44, hy =€22—€55, hz = €33 — C66 
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For D3, denote Xi, = [hi, 2’), Ay = [hb, v'], A5 = [h, 2’] where x’ is an eigenvector. 


Let N = (Ai, A$, 45). The eigenvalues for Az and D3 are listed on the following 


table: 
x A= (Az, A2, A3) x! OY = ( i. 3 Ag) 
€13 a(1, 1, -1) €26 — €25 al = (0,1, 1) 
€31 k= (-1, 1, 1) €62 — €53 a= (0, —1, —1) 
€24 B(-1,1,1) €23 — €65 B'(0,1, -1) 
€42 —(' = (1,-1, -1) €32 — €56 —8'(0, -1,1) 
C41 a (—1,0, —1) €12 — €54 7! = (Ge —1, 0) 
e14 7 = (1,0,1) €21 — €45 —~7'(-1,1,0) 
€43 (a+) = (0,1, —2) €16 — €34 (a’ + 7) = (1,0, 1) 
34 —(a+-y) = (0,-1,2) e61 — e43 || —(a’ +7) = (1,0, 1) 
€21 B+y7= (-2,1,0) €13 64 B' +7 = (1,0,-1) 
€12 —(8 +7) = (2, -1,0) €31 — €46 =(Praey) = (—1,0,1) 
€23 at+B+7=( Le, 1) €15 — €24 a’ + 8’ +7 = (1,1,0) 
€32 —(a+ B+) = (D2; 1) €51 — €42 al + B+ = (1, 1,0) 


Figure 3.4: Table of eigenvalues for A3 and D3 


We can construct a linear transformation: 


H,=—h, +h, Hi=h,+hy, H,=-h, —hy 


The insomorphism between A3 and D3 is thus obtained by defining a map between 


diagonal matrices; 


a(hi) = a!(Hj), Bhi) = 0!(Hj), (ha) = (Ay) 


Therefore A3 = D3, where: 


€11 


€22 


-> 


€22 — €33 t> 


€33 


(e11 — e44) + (€33 — €66) 


(€11 — €44) + (€22 — €55) 


€44 
€13 > 
€31 > 
€24 > 
€42 +> 
€41 +> 
e€j4 > 
€43 +> 
€34 +> 
€21 > 
€12 +> 
€23 +> 


€32 +> 


3.3. Automorphisms 


> 


(€11 — e44) — (€33 — €66) 


€26 — €25 
€62 — €53 
€23 — €65 
€32 — €56 
€12 — €54 
€21 — €45 
€16 — €34 
€61 — €43 
€13 — €64 
€31 — €46 
€15 — €24 
€51 — €42 
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The set of inner automorphisms of a ring, or associative algebra A, is given by the 


conjugation element, using right conjugation, such that: 


Given x,y € A: 


ya: ATA 


Ya(z) 


=a ‘ra 


1 


Ga(xy) = a *(ay)a = a~*waaé "ya = (a~*wa)(a~"ya) = yal) Pa(y) 


Where, (q is an (invertible) homomorphism that contains y,'. Therefore y, constitutes 


an isomorphism onto itself. Since the composition of conjugation is associative, a~ 


often denoted as x”. 


1 


xa is 


Definition 3.12. An automorphism of L is an isomorphism of L onto itself. Aut L 


denotes the group of all such. 


38 


Example 3.3.1. Consider L C gl(V), and g € gl(V) where g is an invertible 


1 


endomorphism. Define a mapping @: L — L’ where xLa~! = L, then x > gxg~', where 


@ is an automophism of L. The same can be said for s\(V) since the trace of a matrix is 


invariant under a change of basis. 


Example 3.3.2. Let L be a Lie algebra such that L = sl(n, F),g € GL(n, F). The map 
y:L—L defined by x > —gatg-!(xt = the transpose of x) belongs to Aut L. When 


n = 2, g = the identity matrix, we can prove that Aut L is inner. 


tr(—ga'g”') = tr(—gg"'a*) = tr(—at) = —tr(x) {Since g = the identity matrix} 


Sar (a) = 0: tr(—gatg ') =0 


Therefore, the map is a linear automorphism of sl(n, F’). If we apply the transpose to 


the commutator, for x,y € L, we have: 


[z, yl’ = (zy — yx)’ = (xy) — (yx)! 


= ytat — gty? = [ye] 
Therefore: 


t—1 


y[z,y] = —glz.y]'9~* = -gly’, "Jo 


{ By properties of the transpose} 


= g(y'a’ — a'y')g* = gy'z'g | — ga'y'g” 


1 
= [gy’g*, gr'g*] = gy'g ‘ga'g”' — gx'g 'gy'g* 
= [y(z), o(y)] 


Therefore, y is a homomorphism. Thus Aut L is inner. 


Example 3.3.3. An automorphism of the form exp(adz,), with ad, nilpotent, i.e., 


(ad,)* =0 for some k > 1, is called inner. 


To derive the power series expansion for exp(ad,), we begin by constructing the Taylor 
series expansion using the differential operator for matrix A. Given an invertible matrix 


A, the solution to 2’ = A# might be 2/(t) = Ave*’. Multiplying by A, we have: 


Ax(t) = Ave 
where Azx(t) = 2/(t) S \v = Av 
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where \ is an eignenvalue of A with corresponding eigenvector v. Then the power series 


expansion to define e** when \ is complex becomes: 


At _ | 1 24 | 1 n 


[Eve66] Let charF = 0, and x € L such that ad, is nilpotent. That is, (ad,)* = 0 for 


k > 0. Then, since ad, is finite dimensional, the above series expansion generalizes to: 


id ad‘, 
exp(ad,) = iI 
k=0 

(adx)* | (adz)° (ad,)*~" 

a a | (k — 1)! 


where exp(ad,) € AutL. To show this, recall that Leibniz’s rule for the product of 


derivations is: 


Tew) = FF Oey) 


Where 6 is an arbitrary nilpotent derivation of LZ. Given x,y © L: 


n-1 bi ~ n-1 5d 
Se 


j=0 


exp(d(a) )exp(5(y)) 


k— nm 
7 s 6” (ay) 
n=0 nl 
k-— 
_ O"(ay) 
7 n! 
n=0 


Where the first equality holds because exp(adz) = )77°.9 4) and the second equality 
satisfies Leibniz’s product rule. Since 6 is a nilpotent derivation of L, the last equality 
holds because 6* = 0. Therefore, the composition of two inner automorphisms is again 
an inner automorphism, thus the derivation of an inner automorphisms eap(ad()) is a 


homomorphism: 


leap(d(x)), exp(O(y))] = exp([d(x), 6(y)]) 


Where exp 6 is invertible by exhibiting the inverse: 1 —n+n?—n°+...+n'~!, exp 


d6=n-+1. Thus, exp(ad,) € AutL. The subgroup of all such inner automorphisms 
exp(ad,) is called Int(L) and defines a normal subgroup of Aut(L). 
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Definition 3.13. When ad, is nilpotent, the inner automorphism constructed is called 


Int L. For ¢@€ Autlve L, 


d(ad,)o = ad g(x) when gexp(ad,)o— = exp(adg(z)) 


Example 3.3.4. Let a be the automorphism of sl(2, F’) give by the following: let 

L € sl(2, F), with standard basis (x,y,h). Define o =exp ad x- exp ad (—y): exp ad x. 
We can show that o(x) = —y,o(y) = —z,0(h) = —h. From example 3.1.2, we have 
[2,4] = h, [h, y] = —2y, [h, x] = — 2a 


a(x) = exp adx - exp ad(—y) - exp adx(x) 


ad(—y)? 
ey 


= exp adx- (1+ ad(—y) + 


= exp ade (2 + (—ady)(x) + 5, (ad(—y))*(2)) 
= exp ade (2 ~ [y,2] + 5 (ad(—y))(h)) 


= exp adx - (x +h+ [—y, Al) 


ad(a)? 


2! 


= (1+ adr 4 \(a+h—y) 


[x[x,x]]  [x,-22]  [x,h] 
2! 2! 2! 


=(a@+h—y) + ([2, 2] + [2, h] — [2,y]) + ( 


= (+h—y) + (-20—h) + (2) 


o(y) = exp adx - exp ad(—y) - exp adx(y) 


ad(x)? 
exp adx - (1+ adx 4 ae) (y) 


exp adx - (y+ [x,y] 4 Ie, ie, 9] 


= exp adx - (1+ ad( y) += 


\y+h—z) 


= exp adx-[(y+h—z)+([-y,y] + [-y, Al — [-y, z]) 


—y,[—y, —y,|-y,h —y,|—-Yy,&@ 
, (Eek wall, Ey yl uF wally 
= exp adz-(y+h—2y—ax—h+y) 

ad(x)? 

(a) 


= (1+ adz 4 


o(h) = exp adx - exp ad(—y) - exp adx(h) 
ad(x)? 


= exp adx - exp ad(—y) - (1+ ada + or (A) 

= exp adx - exp ad(—y) - (h + {x, h] 4 fe 
ey. 

= exp adz - (1+ ad(—y) + Ga Ys) )(h — 22) 


= exp adx - [(h — 2x) + ([-y, h] — 2[-y, a]) + (ee =, Al], 2[-y, -y, al] 


= exp adx - |(h — 2x) + (—2y — 2h) + (04+ 2y)] 

= exp adx(—h — 22) 

ad(x)? 
! 


= (1 + ada + —— )(—h — 22) 


= (—h — 2x) + (—[x, h] — 2[x, x]) + ( 


Al 
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Chapter 4 


Solvable and Nilpotent Lie 
Algebras 


4.1 Solvability 


In this section, we will decompose FL into a collection of subalgebras. Given D as the 
direct sum of ideals, the derivation of L’ for i = 1,2...n, consists of a mapping such that 


the commutator [L’, L"] is zero. 
Definition 4.1. The derived series of a Lie algebra L is a sequence of ideals of L where 
P= Eo) = (6h), S(O) Se. 
Given a Lie algebra L, and an ideal I, we have: 
Lry=(L0N CLO CU0+U,=2' [Wan75] 


Therefore, if I is an ideal of L, then so is I'. Unless otherwise stated, the material for 


this chapter is derived from [Hum72]. 


Example 4.1.1. Given I is an ideal of L, we can show I"*! is an ideal of L using 


induction. 


Proof. Since I is an ideal of L, I° = I is also an ideal of L, as is I‘ by the above. 


Assume J” is an ideal of DL. Let x € L,y,z EI”. 
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[x[yz]] = —[y[za]] — [z[xy]] {By the Jacobi identity} 
€ [y, I") + [z, 2" {Since [z, 2], [x,y] € I”} 
€ (17,1) + 1", 2") {Since y, z € I”} 
eyetiy jt 
err 


Therefore J”* is an ideal, making each member of the derived series of J an ideal of 


L. 


Definition 4.2. A group G is said to be solvable if it has a subnormal series 


G=Go>G,>Go>...>Gnr=e (4.1) 
in which the factors G;/Gi41 are all Abelian for alli,O<i<n-1. 


By subnormal, we mean that for each 7, 0 <i<n-—-—1, G; is normal in G;,,. Given a 
Lie algebra L, and series length n > 0 we say that L is solvable if L( = 0, that is, a Lie 
algebra is solvable if its derived series terminates in the zero subalgebra. The derived 
subalgebra of a finite dimensional solvable Lie algebra over a field of characteristic 0 is 
nilpotent, thus making abelian algebras solvable and conversely simple algebras 


nonsolvable. 


Lemma 4.3. Let0 > J > L > I -— 0 be an exact sequence of Lie algebras. Then L is 
solvable if and only if both J and I are solvable. [Sam90/ 


Proposition 4.4. 


1. If L is a solvable Lie algebra, then so are all of the subalgebras and homomorphic 


images of L. 


2. If I is a solvable ideal of L such that the quotient L/I is solvable, then L itself is 


solvable. 


3. If I, J are solvable ideals of L, where L is a solvable Lie algebrra, then I+ J is 


solvable. 
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Proof. 


1. Let K be a subalgebra of L, then by definition, K® Cc L, therefore K is 
solvable for all 7. If we define a map y: L — M where y is an epimorphism, then 


by induction, we can show y(L®) = M®, 


2. Let (L/I)\™ =0. By construction of the canonical homomorphism 7 : L > £, we 
have that 7(L‘)) = 0 by part 1. or L™ Cc I =kern. If 1 =0, then 
(LO)@ = L449) = L+™) — 0 (since L™ Cc I). 


3. By Proposition 3.11 (part 3.): if J, J are ideals of L, then from the sequence 


Cf Sia 2 va > 0, there exists a natural isomorphism between the third 
term (I+ J)/J and I/(IN J). Now (1+ J)/J is solvable by the Lemma 4.3, since 
the homomorphic image of I is solvable. Therefore [/(IM J) is solvable by part 2. 


As a consequence of Proposition 4.4, if D is solvable, then L possesses a unique maximal 
solvable ideal, called the radical of L, or Rad L. When L is semisimple, Rad L = 0. 
Recall, L is simple if it contains no non trivial ideals (i.e., only contains the trivial ideals, 


0 and LF itself), that is, if Rad L = 0. Therefore, every simple algebra is semisimple. 


Example 4.1.2. L is solvable if and only if there exists a chain of subalgebras 


L=Ipj>D 11>... Dd Ly =0 such that: 
e Li41 ts an ideal of L;. 


is abelian. 


e Each quotient ri 


Proof. 
=> 


e Since L is solvable, there exists a sequence of ideals of DL such that 
LDL D...D L® =0 for some i. By definition, each ideal forms a subalgebra 
of L. By Proposition 4.4, L@+) C L© where L“+) and L© are ideals of L. 


e Since L(+) is an ideal of L® C L, Lt” is an ideal of L. Since L is solvable, 


faut ‘ * (@) 
then by definition of the subnormal series, we can show each quotient rea is 
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abelian. By definition of the derived series, L°+) = [L©, L]. Let [x,y] € LD, 
with z,y € L©: 


[2 + LOD y+ LOY) = fe, y) t fe, LOY] + [LOY y] 4 [OY OY] 


oad Ee y] ele ps) 


L® . ; 
Therefore TaD is abelian. 


< Given vo is abelian and L = Lg D L, >... D Ly = 0, then the derived series for DL 


terminates in the zero subalgebra. Therefore, by defintion, Ly is solvable, making wo 


solvable, and thus Lz; is solvable by Proposition 4.4. Similarly, ise 


; is solvable, and 


thus Ly_2 is solvable. Continuing this way, ya is abelian and thus Lj is solvable. 


Therefore Lo = L is solvable. 


Example 4.1.3. We can show that L is solvable if and only if adz is solvable. 


Proof. 
= Let L™ =0. By induction, when n = 0, L© = L, and ad;,.o) = ady = (adz). 


Therefore ad is solvable for n = 0. Assume adj(n) = (adz)™, 


ady (nti) = ad[L™, LW] {By definition of the derived series} 
= [(adp)™, (adz)™] {Since adj,) = [adz, ady] } 
= (adz)"*») {Since L™+! = [L”", L"]} 


Therefore adz is solvable. 
< Since ady is solvable, (ad,) = ad” = 0, by the induction above. Therefore 


L™ C Z(L). Therefore Za) is solvable and thus L is solvable. 


4.2 Nilpotency 


In this section, we shall explore the descending central series, defined as a recursive 
sequence of subalgebras that ends in the zero subspace. We shall rewrite this series 
using the quotient space to determine the effect of having a nilpotent algebra as it 


relates to it’s subalgebras and their homomorphic images. 
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Definition 4.5. The descending central series is a sequence of ideals of L defined 
as D9 =L, 01 = [LI], L? =[LL4],...,L° =[LL*"1]. We call L nilpotent if for some 
n>0, L”=0. 


Thus L’ is spanned by long brackets [Xj[Xo[...X741]...] (which we abbreviate to 
[X1X9...X;41]) [Sam90]. Notice that L” = 0 makes any abelian algebra nilpotent. 
Therefore nilpotent implies solvable since L C L’ for all i. However solvable does not 


imply nilpotent. 


Example 4.2.1. The algebra n(n, F’) of strictly upper triangular matrices is nilpotent. 
A matrix is strictly upper triangular if and only ifi > j7—k. So Given BE = 0, implies 
Bk =0 fork >m if B is anmxm matrix. Using induction, we have B! = 0 for 


i.j —1. Assume BE. = 0 fori,j —(k-—1): 


a m 
Bh = BBE! =~ BABE + So By BES 
fH=1 r=i+1 


=Sro- Bit 4 - Bir -0 =0 
ra 


r=i+1 
Note that the first term goes to zero, since r <1 where B;, is strictly triangular. The 
second term goes to zero sincer >it+1>(j—k)+1=j-—(k-—1). Therefore, by the 
induction step, BE. =0. 


We can show a similar result for lower triangular matrices using the following indices: 


m>ji>l Cy = Diz =0 


m>i>l Cy = Dex =0 
Which would prove that (CD);; = 0 [Eve66/ 
Proposition 4.6. 


1. If L is a nilpotent Lie algebra, then all the subalgebras and homomorphic images 


of L are nilpotent. 
2. If L is a Lie algebra such that L/Z(L) is nilpotent, then L is nilpotent. 
3. If L#0 is a nilpotent Lie algebra, then Z(L) #0 


Proof. 


AT 


1. Let K be a subalgebra, then by definition K* C L’. Similarly, if ¢: L + M is an 


epimorphism, we can show, by induction on 3, that y(L’) = M* 
2. Let L” Cc Z(L), then 


|b aaa ol Ue Bd {By definition of the descending central series} 


=f, 27(L)|=0 {By definition of the center} 


3. If L is nilpotent, L” = {0}, and L"~! ¥ {0}. By definition of the descending 
central series, L”~' = [L, L"], which implies that L”~! C Z(L). Therefore 
Zi ds). 0; 


Example 4.2.2. Let I be an ideal of L. Then each member of the descending central 


series of I is also an ideal of L. 


Proof. Since I is an ideal of L, I° = I is also an ideal of L. Assume I” = [I,I"~"] is an 


ideal of L. Let xe L,yel,ze lI”. 


(z[yz]] = —[y[za]] — [z[xy]] {By the Jacobi identity} 
E [y, 1”) + [z, J] {Since [z, x] € I”, |x, y] € I} 
€ [7,7] + (0°, 0] {Since y €I,z eI} 
= prt y pnt 
= prt 


Therefore J”*" is an ideal, then each member of the descending central series of I is an 
ideal of L. 


Example 4.2.3. We can show that L is nilpotent if and only if adz is nilpotent. 


Proof. Using induction, let n = 0, then L° = L. Therefore ado = (adz)° = ad. Let 
adzn = (adz)". Then 


adzpn+i = (ad, adzn] {By definition of the descending central series} 
= {ad_, (adz)”] {Since adpn = (adz,)"} 


= (ad,)"*" 
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= Let L” = 0 for some n. By the above induction, adzn = (adz)” = 0 and thus adz is 
nilpotent. 


< Given adz is nilpotent, (adz)" = 0. Since (adz)” = adn = 0, L” = 0, therefore L is 


nilpotent. 


Example 4.2.4. The sum of two nilpotent ideals of a Lie algebra L is again a nilpotent 


ideal. Therefore, L possesses a unique maximal nilpotent ideal. 


Proof. Let I, J be nilpotent ideals of L. By definition, 1”, J” = 0 for some m,n. Let 
n>m. Let « € I,y € J, then: 
[z+y,¢+y] = [v2] + [x,y] + [ye] + [y, 9] 
CU, +7] + [42] +[4, J] 


=([L0+([,/) +157] 


CU, ]+[J, J) tINJ=[[45,1+ J] [Sam90] 


Assume by induction, (I+ J)* C 17+ J™ +I J. Our aim is to show the there exists a 
k such that (I + J)* = 0 where (I+ J)* = [I+ J, (I+ J)*“1], by definition of the 
descending central series. Let k = n+ m, by the use of binomial expansion: 
(I+ J ® = (+ Arr are gee dy tm ty gm 
=O 70 Jeep ihr 20 {Since lJ" = 0h: 


Crags nd 


where (I+ J)*¥ = (1+ J)"*™ =0. Thus J 4+ J is a nilpotent ideal. 


Example 4.2.5. Let charF = 2. We can prove that L = sl(2, F’) is nilpotent. Recall the 


standard basis for L: 


1 O 1 0 0 
w= Yr = 
0 -l 0 O 1 O 
ie z] = [(e11 aa €22), e21] = (e11 = €22)(€21) =. (e21)(€11 = €22) = —e€21 — €21 = —2€21 = —2z 


[x,y] = [(€11 — €22), €12] = (€11 — €22)(€12) — (€12)(€11 — €22) = €12 — (—e12) = 2e12 = 2y 


[y, z] = [e12, €21| = €12€21 — €21€12 = €11 — €22 = & 


A9 


Therefore, by definition of the lower central series, 


QO —-2y 2z 
20, FP) sO. )) = | By. O. Se 
—22  & 0 
Recall |x, 2] = 0. Since charF = 2, then |z,z| =0='|z; yly and 
0 0 O 
[sl(2, F),s((2,F)) = ]0 0 -a | =F x. Therefore by the Jacobi relation, 
0 xz 0 


L? = [s\(2, F), [sl(2, F), s1(2, F)]] = [sl(2, F), Fx] =0. 


4.3 Proof of Engel’s Theorem 


The purpose of Engel’s Theorem is to connect the property of Lie algebra nilpotence to 
operators on a vector space. A Lie algebra is nilpotent if for a sufficiently long sequence 
{xi} of elements of L, the nested adjoint ad(ap)[...ad(x2)[ad(x1)[y]]] is zero for all y € L. 
That is, applying the adjoint sufficiently many times will kill any element belonging to 
L, rending each x € L ad-nilpotent. Engel’s theorem states the converse as true. In 
preparation for Engel’s theorem, we must first observe the following lemma, then prove 


Theorem 4.8. 

Lemma 4.7. Let A be a nilpotent operator on a vector space V, then 
1. There exists a non-zero v € V such that Av = 0. 
2. ada is a nilpotent operator on gl(v). 


Theorem 4.8. Let L be a subalgebra of gl(v), with V finite dimensional. If L consists 
of nilpotent endomorphisms and V #0, then there exists a nonzero v € V for which 


Lv=0. 


Proof. Using induction on dimL. If dim L = 0, then clearly D.v = 0 and the theorem is 
true. If dim D = 1, then recall that a single nilpotent linear transformation contains at 
least one eigenvector corresponding to eigenvalue 0, thus the theorem is true and 


Lv=0. 
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Given L is a nilpotent Lie algebra, L(,) = 0 for all 1,...,a,2 € L. Let ad, € gl(L) 
with x € L ad-nilpotent, then: 


[eas oe em cree P| =0 
& (ads, \ads. icldde, Ge) = 0 


& (adz)” =0 


[GG78] 

Suppose Kk # F is any nontrivial subalgebra of L. Consider the vector spaces of AK and 
L. Since K is a subalgebra of a nilpotent endomorphism, ad;(/ + K) = ad;,(l) + K isa 
well defined map where, for k’ € K: 


ady(l+ k’ + K) = ad;(l) + ad, (k’) + K 
= ad;,(1) +k 


Thus ad, induces a nilpotent linear transformation on the quotient subspace L/K. 
Since K is a subalgebra of L, dimk < dimL. From the induction assumption and 
Lemma 4.7, it follows that there exists some non-zero vector +k #k in L/K such that 
x +k is killed by the image in gl(L/K). By definition, Nz (AK) = {x € L|[x,k] C Kk}, 
therefore K is properly in the normalizer of K in L since for x € L,y € K, [x,y] CK 
where x ¢ K. Let K be a maximal proper subalgebra of L, then every x € L has 

[x,k] € K for all k € K, which makes N;(/) an ideal of L. Consider the dimension of 
L/K. If dimL/K > 1, then there exists a 1 dimensional subalgebra Fu/K C L/K. By 
the correspondance theorem, K C F'v C L, but L is not one dimensional. Therefore, K 
has codimension 1, which makes L/K one dimensional. Since L/K is 1-dimensional, 
Span{l € L} = F(z+L)K for some nilpotent endomorphism z € L. For each | € L, 
1+ K=az+K,keK. Therefore az € K, and thuysl=az+k for zel—k. 


By induction, W = {v € V|k.v = 0} £0. By the above, K is an ideal of L. If 


xe L,y€ K,w €V, then since yx — ry = —|x, y] in End (V), we have: 
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(yx)-w— (xy): w= —[z,y]-w 


(yx) -w = (xy): w—[z,y]-w 


= r(y(w)) —0-w {since [x,y] is an ideal of k} 
=p {since (y(w)) =0,y € kK} 
=0 


Since z is a nilpotent endomorphism in L, L stabilizes W, therefore, z: W — W is 
nilpotent. For 2” = 0 (z"-! 40), there exists a u € V such that 2”~!u 4 0. Let 

vo = 2” tue W. Then zvp = z” = 0, Thus, there exists an eigenvector vp € W(v 4 0) 
for which K(v,) = 0. Therefore, L.v = 0 since L= K+ Fz. 


Theorem 4.9. Engel I Let V be a vector space; let L be a sub Lie algebra of the 
general linear Lie algebra g\(V), consisting entirely of nilpotent operators. Then L is a 


nilpotent Lie algebra. 


Proof. By Theorem 4.8, if we apply the dual representation (inverse transpose) of LZ on 
V*; the operators are nilpotent. Let 4 4 0 be a function on V that is anulled by L, then 
the space (L- v) spanned by all Xv with X € L, is a proper subspace of V, and in fact, 
is in the kernal of \ where \(Xv) = X‘A(v) = 0. Recall (LZ - v) is invariant under L, so 
we can iterate the argument. Let m = dimV, then the abbreviated iteration 
X1-X9...Xm, vanishes, since for every X;, the dimension of V is decreased by at least 1 
where any long bracket [X 1X ...X;] expands by the bracket operation into a sum of 
products of kX’s. [Sam90] 


Theorem 4.10. Engel IT If all elements of L are ad-nilpotent, then L is nilpotent. 


Proof. Given a Lie algebra of LZ having only ad-nilpotent elements, since the adoint is a 
linear transformation, adz C gl(L), which satisfies the hypothesis of Theorem 4.8 when 
L #0. Thus there exists an x € L such that [L,x] = 0, which implies that the center of 
L is nonzero. Therefore L/Z(L) consists of ad-nilpotent elements, where dim 
adyjz(z) < dimL. Using an induction argument on the dimension of L, similar to 
Theorem 4.8, it follows that L/Z(L) is nilpotent. Therefore, by Proposition 4.6 (part 
2.), if L/Z(L) is nilpotent, then so is L. 
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Corollary 4.11. Let L be nilpotent, and K be an ideal of L. Then if 
kAO KAZE). £0 


Proof. Since K is an ideal of L, L induces a linear transfomation on K via the adjoint 


representation, therefore there exists x € K(k #0). Therefore [L, x] = 0 by definition of 


nilpotency, and thus x € KM Z(L) as desired. 
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Chapter 5 


Conclusion 


Now that we have covered the basics, we can further explore the root system of the 
classical Lie algebras. The full set of roots of a Lie algebra L can be generated from the 
set of simple roots and their associated Weyl reflections. If we further explore nilpotent 
subalgebras of a Lie algebra that are self-normalising, we invariably must investigate 
cartan subalgebras and the killing form. 

Differentiating a Lie group action yeids a Lie algebra action. A Lie algebra is a 
linearization of a Lie group action, and allows one a doorway to study Lie groups. The 
foundation of Lie theory is the exponential map relating Lie algebras to Lie groups, 
known as the Lie group-Lie algebra correspondence. We briefly touched on this map 
earlier in section 3.3, dealing with Lie algebra automorphisms. 

The question arises: what real world applications do Lie algebras possess? Classical 
Lie algebra representations have applications in physics. They can be used to study 
fluid dynamics. Symmetries in physics take on the structure of mathematical groups, 
where continuous groups represent Lie groups. Collisions of vector bosons in quantum 
field theory is tied to Lie algebra, specifically gauge theory, and as such, becomes a 
mechanism to interpolate physical interactions. The algebra can be promoted to a 
group and interpreted as a symmetry (gauge symmetry). Lie algebras are thus integral 


to describing theories of nature. 
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